Bias toward Higher Performance for the Decomposed Network. Whether a Decomposed Network Would Learn

نویسندگان

  • References Fisher
  • D E Rumelhart
  • G E Hinton
  • R J Williams
چکیده

faster than a non-decomposed network when the number of weights in the two networks is approximately equal remains to be tested. Second, we performed a more extensive search in and space than [Kamm and Singhal, 1990], resulting in generally higher learning rates and training times of only a few epochs. The range of learning rates explored in the current study included higher rates than those explored in [Kamm and Singhal, 1990], so it is dicult to determine whether the learning speed-up (in terms of arithmetic operations) is attributable to problem decomposition or learning rate. To address this issue, we also trained monolithic 125ms networks for approximately the same number of operations as the decomposed net, using a) the learning parameters used for the 125ms subnetwork and b) the original parameters for the monolithic network. On the 200-sentence-test set, these networks had normalized AHAs of 47.3% and 46.3% and normalized AFAs of 5.9% and 5.1%, respectively. This performance is markedly poorer than performance of the decomposed network, supporting the hypothesis that the problem decomposition strategy was a primary contributor to the learning speed-up. The problem decomposition strategy facilitated the more thorough search for learning parameters, since it allowed the use of dierent parameters for training the dierent subnetworks. In many empirical studies utilizing the back-propagation algorithm, it is dicult to control for the and parameters. Often, they are held constant over all experimental conditions. However, since optimal parameter values are often problem-dependent, this constraint can result in misleading comparisons if and are chosen optimally for one network but not another. Our experience in this project supports the fact that the careful selection of network training parameters is critical for successful training (and performance). 6 Summary Problem decomposition was applied to AP-net, a neural network for mapping acoustic spectra to phoneme classes. AP-net was decomposed into subnetworks diering in both output units (assigning phoneme classes to subnetworks based on their durations) and input units (providing dierent input spans to the subnetworks). Subnetworks were trained individually and then combined into a larger network using \glue" units and ne-tuning of the entire network. This problem decomposition strategy achieved comparable performance to that of a monolithic network, but required an order-of-magnitude fewer arithmetic operations during training. This result provides evidence for the general utility of this problem decomposition methodology for addressing the learning speed problem in back-propagation. Table 1 summarizes the conditions used to train …

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluating Subunits Importance in Performance Measurement of Network Systems in Data Envelopment Analysis

In conventional DEA models, decision making units (DMUs) are generally assumed as a black-box while the performance of decision making sub-units (DMSUs) and their importance play crucial roles in  analyzing the performance of systems which have internal processes. The present paper introduces an ideal network which have efficient processes and next purposes a new approach for evaluating importa...

متن کامل

Hybrid Models Performance Assessment to Predict Flow of Gamasyab River

Awareness of the level of river flow and its fluctuations at different times is one of the significant factor to achieve sustainable development for water resource issues. Therefore, the present study two hybrid models, Wavelet- Adaptive Neural Fuzzy Interference System (WANFIS) and Wavelet- Artificial Neural Network (WANN) are used for flow prediction of Gamasyab River (Nahavand, Hamedan, Iran...

متن کامل

Hybrid Models Performance Assessment to Predict Flow of Gamasyab River

Awareness of the level of river flow and its fluctuations at different times is one of the significant factor to achieve sustainable development for water resource issues. Therefore, the present study two hybrid models, Wavelet- Adaptive Neural Fuzzy Interference System (WANFIS) and Wavelet- Artificial Neural Network (WANN) are used for flow prediction of Gamasyab River (Nahavand, Hamedan, Iran...

متن کامل

Generalization of Decomposed Integration Methods for Cost Effective Heat Exchanger Networks with Multiple Cost Laws

At many circumstances, in heat exchange processes several exchangers were used with different cost laws due to their pressure ratings, materials of construction and exchange3r types. In such circumstances traditional methods of pinch technology can not be led to minimum total annual cost may cause some other disadvantages like more complexity or higher maintenance. In this research work a n...

متن کامل

تحلیل آشوب، تجزیۀ موجک و شبکۀ عصبی در پیش‌بینی شاخص بورس تهران

This study investigates predictability, chaos analysis, wavelet decomposition and the performance of neural network models in forecasting the return series of the Tehran Stock Exchange Index (TEDPIX). For this purpose, the daily data from April 24, 2009 to May 3, 2012 is used. Results show that TEDPIX series is chaotic and predictable with nonlinear effect. Also, according to obtained inverse o...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 1991